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Abstract. We propose new filtering algorithms for the SEQUENCE constraint 
and some extensions of the SEQUENCE constraint based on network flows. We 
enforce domain consistency on the SEQUENCE constraint in 0{n^) time down 
a branch of the search tree. This improves upon the best existing domain con- 
sistency algorithm by a factor of 0(log n). The flows used in these algorithms 
are derived from a linear program. Some of them differ from the flows used to 
propagate global constraints like Gcc since the domains of the variables are en- 
coded as costs on the edges rather than capacities. Such flows are efficient for 
maintaining bounds consistency over large domains and may be useful for other 
global constraints. 



1 Introduction 

Graph based algorithms play a very important role in constraint programming, espe- 
cially within propagators for global constraints. For example, Regin's propagator for 
the AllDifferent constraint is based on a perfect matching algorithm [1], whilst his 
propagator for the GCC constraint is based on a network flow algorithm [2]. Both these 
graph algorithms are derived from the bipartite value graph, in which nodes represent 
variables and values, and edges represent domains. For example, the Gcc propagator 
finds a flow in such a graph in which each unit of flow represents the assignment of 
a particular value to a variable. In this paper, we identify a new way to build graph 
based propagators for global constraints: we convert the global constraint into a linear 
program and then convert this into a network flow. These encodings contain several nov- 
elties. For example, variables domain bounds can be encoded as costs along the edges. 
We apply this approach to the SEQUENCE family of constraints. Our results widen the 
class of global constraints which can be propagated using flow-based algorithms. We 
conjecture that these methods will be useful to propagate other global constraints. 



2 Background 

A constraint satisfaction problem (CSP) consists of a set of variables, each with a finite 
domain of values, and a set of constraints specifying allowed combinations of values 
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for subsets of variables. We use capital letters for variables (e.g. X, Y and S), and 
lower case for values (e.g. d and di). A solution is an assignment of values to the vari- 
ables satisfying the constraints. Constraint solvers typically explore partial assignments 
enforcing a local consistency property using either specialized or general purpose prop- 
agation algorithms. A support for a constraint C is a tuple that assigns a value to each 
variable from its domain which satisfies C. A bounds support is a tuple that assigns 
a value to each variable which is between the maximum and minimum in its domain 
which satisfies C. A constraint is domain consistent {DC) iff for each variable X^, every 
value in the domain of Xi belongs to a support. A constraint is bounds consistent (BC) 
iff for each variable Xi, there is a bounds support for the maximum and minimum value 
in its domain. A CSP is DCIBC iff each constraint is DCIBC. A constraint is monotone 
iff there exists a total ordering -< of the domain values such that for any two values v, 
wiiv -<w then v is substitutable for w in any support for C. 

We also give some background on flows. A flow network is a weighted directed 
graph G = (F, E) where each edge e has a capacity between non-negative integers l{e) 
and w(e), and an integer cost w{e). A feasible flow in a flow network between a source 
(s) and a sink (t), (s, i)-flow, is a function f : E ^ Z+ that satisfies two conditions: 
/(e) e [^(e), u{e)], We € E and the flow conservation law that ensures that the amount 
of incoming flow should be equal to the amount of outgoing flow for aU nodes except 
the source and the sink. The value of a (s, t)-flow is the amount of flow leaving the sink 
s. The cost of a flow / is w(/) = J2eeE A minimum cost flow is a feasible 

flow with the minimum cost. The Ford-Fulkerson algorithm can find a feasible flow in 
0{(j>{f)\E\) time. If w{e) e Z, Ve e E, then a minimum cost feasible flow can be 
found using the successive shortest path algorithm in 0{(j){f)SPP) time, where SPP 
is the complexity of finding a shortest path in the residual graph. Given a (s, t)-flow / 
in G{V, E), the residual graph G/ is the directed graph (V, Ef), where Ef is 

{e with cost w{e) and capacity ^.(^(e) — /(e)) \ e = {u,v) G E, /(e) < w(e)} |^ 

{e with cost — 'w{e) and capacity 0..(/(e) - 1(e)) \ e = {u,v) e E, 1(e) < /(e)} 

There are other asymptotically faster but more complex algorithms for finding either 
feasible or minimum-cost flows [3]. 

In our flow-based encodings, a consistency check will correspond to finding a fea- 
sible or minimum cost flow. To enforce DC, we therefore need an algorithm that, given 
a minimum cost flow of cost w(/) and an edge e checks if an extra unit flow can be 
pushed (or removed) through the edge e and the cost of the resulting flow is less than 
or equal to a given threshold T. We use the residual graph to construct such an algo- 
rithm. Suppose we need to check if an extra unit flow can be pushed through an edge 
e = (u, v). Let e' = (u, v) be the corresponding arc in the residual graph. If w(e) = 0, 
Ve e E, then it is sufficient to compute strongly connected components (SCC) in the 
residual graph. An extra unit flow can be pushed through an edge e iff both ends of 
the edge e' are in the same strongly connected component. If w(e) G Z, Ve G E, the 
shortest path p between v and u in the residual graph has to be computed. The minimal 
cost of pushing an extra unit flow through an edge e equals w{f) + w(p) + w(e). If 
'w(f) + w(p) + w(e) > T, then we cannot push an extra unit through e. Similarly, we 
can check if we can remove a unit flow through an edge. 
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3 The Sequence Constraint 



The Sequence constraint was introduced by Beldiceanu and Contejean [4J. It con- 
strains the number of values taken from a given set in any sequence of k variables. It 
is useful in staff rostering to specify, for example, that every employee has at least 2 
days off in any 7 day period. Another appUcation is sequencing cars along a produc- 
tion line (probOOl in CSPLib). It can specify, for example, that at most 1 in 3 cars 
along the production line has a sun-roof. The Sequence constraint can be defined in 
terms of a conjunction of Among constraints. Among(I, u, [Xi, . . . , Xk], v) holds iff 
I < £ ''}| ^ That is, between I and u of the k variables take values in v. 

The Among constraint can be encoded by channelling into 0/1 variables using Yi <^ 
{Xi e v) and I < X^^Li Yi ^- Since the constraint graph of this encoding is Berge- 
acyclic, this does not hinder propagation. Consequently, we will simplify notation and 
consider Among (and Sequence) on 0/1 variables and v = {!}. If / = 0, Among 
is an AtMost constraint. AtMost is monotone since, given a support, we also have 
support for any larger assignment [5]. The Sequence constraint is a conjunction of 
overlapping AMONG constraints. More precisely, Sequence(/, u, k, [Xi, . . . , Xn],v) 
holds iff for 1 < i < n — fc -h 1, Among(?, u, [Xi, . . . , Xi+k-i],v) holds. A sequence 
likeXi, . . . ,Xi+k-i isawindow. It is easy to see that this decomposition hinders prop- 
agation. If Z = 0, Sequence is an AtMostSeq constraint. Decomposition in this case 
does not hinder propagation. Enforcing DC on the decomposition of an AtMostSeq 
constraint is equivalent to enforcing DC on the AtMostSeq constraint [5]. 

Several filtering algorithms exist for SEQUENCE and related constraints. Regin and 
Puget proposed a filtering algorithm for the Global Sequencing constraint (Gsc) that 
combines a Sequence and a global cardinaUty constraint (GCC) [6]. Beldiceanu and 
Carlsson suggested a greedy filtering algorithm for the CardPath constraint that can 
be used to propagate the Sequence constraint, but this may hinder propagation [7]. 
Regin decomposed Gsc into a set of variable disjoint Among and Gcc constraints [8]. 
Again, this hinders propagation. Bessiere et al. [5] encoded SEQUENCE using a Slide 
constraint, and give a domain consistency propagator that runs in 0{nd''~^) time, van 
Hoeve et al. [9] proposed two filtering algorithms that establish domain consistency. 
The first is based on an encoding into a REGULAR constraint and runs in 0(n2'=) time, 
whilst the second is based on cumulative sums and runs in 0{n^) time down a branch 
of the search tree. Finally, Brand et al. [10] studied a number of different encodings of 
the Sequence constraint. Their asymptotically fastest encoding is based on separation 
theory and enforces domain consistency in 0{n^ logn) time down the whole branch of 
a search tree. One of our contributions is to improve on this bound. 



4 Flow-based Propagator for the Sequence Constraint 



We will convert the Sequence constraint to a flow by means of a hnear program 
(LP). We shall use Sequence(Z, u, 3, [Xi, . . . , Xe\,v) as a rurming example. We can 
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fomulate this constraint simply and directly as an integer linear program: 

l<X2+X^+Xi<u, 

/ < X3 + X4 + Xg < U, 

l<Xi+Xr,+XQ<U 

where Xi e {0, 1}. By introducing surplus/slack variables, Yi and Zi, we convert this 
to a set of equalities: 



Xi + X2 + X^ 
X2 + Xs + X4 

X3 + X4 + X5 

X4 + X5 + Xq 



Y2 



/, X1+X2 + XS + Zi= u, 

I, X2 + Xs + Xi + Z2 = u, 

I, X3 + X4 + X5 + Z3 = U, 

I, X4 + X5 + Xe + Z4 = u 



where Yi, Zi > 0. In matrix form, this is: 



/111 
/111 
1 1 
1 1 
001 
001 
\ 
\o 



000 
000 







1 
1 
1 1 
1 1 



000 
100 
0-10 



-1 
1 










-1 



\ 



Yi 
Zi 



\2J 



This matrix has the consecutive ones property for columns: each column has a block 
of consecutive I's or — I's and the remaining elements are O's. Consequently, we can 
apply the method of Veinott and Wagner [11] (also described in Application 9.6 of [3]) 
to simplify the problem. We create a zero last row and subtract the ith row from i + 1th 
row for i = 1 to 2n. These operations do not change the set of solutions. This gives: 
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-1 





00 
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-1 


-1 


-1 
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Yi 
Zi 



l-u 
u — l 
l—u 
u—l 
l—u 



This matrix has a single 1 and —1 in each column. Hence, it describes a network 
flow problem [3] on a graph G = {V, E) (that is, it is a network matrix). Each row 
in the matrix corresponds to a node in V and each column corresponds to an edge in 
E. Down each colunm, there is a single row i equal to 1 and a single row j equal to 
-1 corresponding to an edge e .B in the graph. We include a source node s and 
a sink node t in V. Let h be the vector on the right hand side of the equation. If hi is 
positive, then there is an edge (s, i) G E that carries exactly bi amount of flow. If bi is 
negative, there is an edge {i,t) G E that caries exactly |6, | amount of flow. The bounds 
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on the variables, which are not expressed in the matrix, are represented as bounds on 
the capacity of the corresponding edges. 

The graph for the set of equations in the example is given in Figure 1. A flow 
of value 4u — 31 in the graph corresponds to a solution. If a feasible flow sends a 
unit flow through the edge labeled with Xi then Xi — 1 in the solution; otherwise 
Xi = 0. Each even numbered vertex 2i represents a window. The way the incoming 
flow is shared between yj and zj reflects how many variables Xi in the j'th window are 
equal to 1 . Odd numbered vertices represent transitions from one window to the next 
(except for the first and last vertices, which represent transitions between a window and 
nothing). An incoming X edge represents the variable omitted in the transition to the 
next window, while an outgoing X edge represents the added variable. 



X3[0.11 




Fig.l. A flow graph for Sequence(/, -u, 3, [Xi, . . . jXe], v) 



Theorem 1. For any constraint SEQUEt^CE{l , u, k. [Xi, . . . ,Xn],v), there isanequiv- 
alent network flow graph G = (V, E) with 5n — 4fc + 5 edges, 2n — 2fc + 3 + 2 
vertices, a maximum edge capacity of u, and an amount of flow to send equal to 
f = (n — k){u — I) + u. There is a one-to-one correspondence between solutions 
of the constraint and feasible flows in the network. 

The time complexity of finding a maximum flow of value / is 0{\E\f) us- 
ing the Ford-Fulkerson algorithm [12]. Faster algorithms exist for this prob- 
lem. For example, Goldberg and Rao's algorithm finds a maximum flow in 
0(mm(|Fp/^ \E\^^'^)\E\log(\V\'^/\E\ + 2)logC) time where C is tiie maximum 
capacity upper bound for an edge [13]. In our case, this gives 0{n'^^^ log n log u) time 
complexity. We follow Regin [1, 2] in the building of an incremental filtering algorithm 
from the network flow formulation. A feasible flow in the graph gives us a support for 
one value in each variable domain. Suppose Xk = v is in the solution that corresponds 
to the feasible flow where v is either zero or one. To obtain a support for = 1 — v, 
we find the SCC of the residual graph and check if both ends of the edge labeled with 
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Xk are in the same strongly connected component. If so, Xk = 1 — v has a support; oth- 
erwise 1 — V can be removed from the domain of Xk- Strongly connected components 
can be found in Unear time, because the number of nodes and edges in the flow network 
for the Sequence constraint is linear in n by Theorem 1. The total time complexity 
for initially enforcing DC is 0{n{{n — k){u — I) + u)) if we use the Ford-Fulkerson 
algorithm or 0(n'^/^ log n log u) if we use Goldberg and Rao's algorithm. 

Still following Regin [1,2], one can make the algorithm incremental. Suppose dur- 
ing search Xi is fixed to value v. If the last computed flow was a support for Xi = v, 
then there is no need to recompute the flow. We simply need to recompute the SCC in 
the new residual graph and enforce DC in 0{n) time. If the last computed flow is not 
a support for Xi = v, we can find a cycle in the residual graph containing the edge 
associated to Xi in 0{n) time. By pushing a unit of flow over this cycle, we obtain a 
flow that is a support for Xi = v. Enforcing DC can be done in 0{n) after computing 
the SCC. Consequently, there is an incremental cost of 0{n) when a variable is fixed, 
and the cost of enforcing DC down a branch of the search tree is O(n^). 



5 Soft Sequence Constraint 

Soft forms of the SEQUENCE constraint may be useful in practice. The ROADEF 2005 
challenge [14], which was proposed and sponsored by Renault, puts forward a vio- 
lation measure for the SEQUENCE constraint which takes into account by how much 
each Among constraint is violated. We therefore consider the soft global constraint, 
SoftSequence(/, u, k, T, [Xi, u). This holds iff: 

n-fe+l k-1 fc-1 

T > ^ max(; - Y,{Xi+j € v), ^.i^i+J & v) - u, 0) (1) 

i=l j=0 j=0 

As before, we can simphfy notation and consider SoftSequence on 0/1 variables and 

^, = {1}. 

We again convert to a flow problem by means of a linear program, but this time with 
an objective function. Consider SoftSequence(/, u, 3, T, [Xi, . . .,Xe],v). We intro- 
duce variables, Qi and Pi to represent the penalties that may arise from violating lower 
and upper bounds respectively. We can then express this SoftSequence constraint as 
foUows. The objective function gives a lower bound on T. 

4 

Minimize ^^(Pj -|- Qi) subject to : 

i=l 

Xi+X2 + X3-Yi+Qi= I, Xi + ^2 + X3 + Zi - Pi = u, 

X2+X3+X4-Y2 + Q2 = l, X2+X3 + X4 + Z2-P2=U, 
X3+Xi + X5-Y3 + Q3 = l, X3+Xi + X5 + Z3-P3=U, 
X4 + X5 + X6-Y4 + Q4 = l, X4 + X5 + X6 + Z3-P4 = U 

where Yi, Zi, Pi and Qi are non-negative. In matrix form, this is: 
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Minimize + Qt) subject to: 



/111 
/ill 
oil 
oil 

1 
1 
. i i i ^ 
\000111 





10 
10 
110 
110 
111 




10 
0-10 



1 








1 



Qi 

Pi 



If we transform the matrix as before, we get a minimum cost network flow problem: 



Minimize X]i=i(-Pi + Oi) subject to: 



/I 1 1000 -1 00000 
' 1 1 








-1 



-10 10 

0000000011000000 
-1 0010000 -1 -1 00000 
00000000001 
00 -1 00100000 
00000000000011 



-1 
1 1 



-1-10 

110 

1 000000 -1 -1 

-1-1 1 1 



u u u 
\ 



-1 
-1 0000000 -1 0000000 










Xe 
Yi 
Zi 



Yi 
Zi 

Qi 

Pi 



The flow graph G = iV, E) for this system is presented in Figure 2. Dashed edges 
have cost 1, while other edges have cost 0. The minimal cost flow in the graph corre- 
sponds to a minimal cost solution to the system of equations 

Theorem 2. For any constraint SOFTS EQUENCe(Z, u, k, T, [Xi, . . . , Xn],v), there is 
an equivalent network flow graph. There is a one-to-one correspondence between solu- 
tions of the constraint and feasible flows of cost less than or equal to max(dorn{T)). 

Using Theorem 2, we construct a DC filtering algorithm for the SoftSequence 
constraint. The SOFTS EQUENCE constraint is DC iff the following conditions hold: 

- value 1 belongs to dom{Xi), i = 1, . . . , n iff there exists a feasible flow of cost at 
most max{dom,{T)) that sends a unit flow through the edge labeled with Xi. 

- value belongs to dom{Xi), i = 1, . . . , n iff there exists a feasible flow of cost at 
most max(dom(T)) that does not send any flow through the edge labeled with Xi. 

- there exists a feasible flow of cost at most min (dom(T)). 

The minimal cost flow can be found in 0(|y||i?| loglogf/log |T^|C) = 
0{n^ lognloglogu) time [3]. Consider the edge {u,v) in the residual graph associ- 
ated to variable Xi and let fc(„,^) be its residual cost. If the flow corresponds to an 



l — U 

u—l 

l — U 

u—l 

l — U 

vr„7 
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Fig. 2. A flow graph for SoftSequence(Z, u, 3, T, [Xi, . . . , Xe]) 



assignment with Xi = 0, pushing a unit of flow on {u, v) results in a solution with 
Xi = 1. Symmetrically, if the flow corresponds to an assignment with Xi = 1, pushing 
a unit of flow on {u, v) results in a solution with Xi = 0. If the shortest path in the 
residual graph between v and u is fc(t, then the shortest cycle that contains {u,v) 
has length k(^u,v) + k{v,u) - Pushing a unit of flow through this cycle results in a flow of 
cost c + k(^u,v) + which is the minimum-cost flow that contains the edge {u, v). 
If c + fc(u.,,) + fc(,,,u) > m;%x{dom{T)), then no flows containing the edge {u, v) exist 
with a cost smaller or equal to max(dom(T)). The variable Xi must therefore be fixed 
to the value taken in the current flow. Following Equation 1, the cost of the variable T 
must be no smaller than the cost of the solution. To enforce BC on the cost variable, we 
increase the lower bound of dom(T) to the cost of the minimum flow in the graph G. 

To enforce DC on the X variables efficiently we can use an all pairs shortest path 
algorithm on the residual graph [15]. This takes 0{'n? logn) time using Johnson's al- 
gorithm [12]. This gives an 0(n^ log n log log u) time complexity to enforce DC on 
SoftSequence. The penalty variables used for SoftSequence arise directly out of 
the problem description and occur naturally in the LP formulation. We could also view 
them as arising through the methodology of [16], where edges with costs are added to 
the network graph for the hard constraint to represent the softened constraint. 

6 Generalized Sequence Constraint 

To model real world problems, we may want to have different size or positioned win- 
dows. For example, the window size in a rostering problem may depend on whether 
it includes a weekend or not. An extension of the SEQUENCE constraint proposed in 
[9] is that each Among constraint can have different parameters (start position, u, 
and k). More precisely, Gen-Sequence(pi, . . . ,Pt7i, [Xi,X2,.. .,Xn],v) holds iff 
KMOKG{l,,u^,k.,,[Xs^,...,Xsi+ki-i\,v)foxl < i < m where Pi = {li,Ui, h, Si). 
Whilst the methods in Section 4 easily extend to allow different bounds I and u for each 
window, dealing with different windows is more difficult. In general, the matrix now 
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does not have the consecutive ones property. It may be possible to re-order the windows 
to achieve the consecutive ones property. If such a re-ordering exists, it can be found 
and performed in 0{m + n + r) time, where r is the number of non-zero entries in the 
matrix [17]. Even when re-ordering cannot achieve the consecutive ones property there 
may, nevertheless, be an equivalent network matrix. Bixby and Cunningham [18] give a 
procedure to find an equivalent network matrix, when it exists, in 0{mr) time. Another 
procedure is given in [19]. In these cases, the method in Section 4 can be appUed to 
propagate the Gen- Sequence constraint in O(n^) time down the branch of a search 
tree. 

Not all Gen-Sequence constraints can be expressed as network flows. Consider 
the Gen- Sequence constraint with n — 5, identical upper and lower bounds (1 and 
u), and 4 windows: [1,5], [2,4], [3,5], and [1,3]. We can express it as an integer linear 
program: 



Applying the test described in Section 20. 1 of [ 1 9] to Example 2, we find that the matrix 
of this problem is not equivalent to any network matrix. 

However, all Gen- Sequence constraint matrices satisfy the weaker property of 
total unimodularity. A matrix is totally unimodular iff every square non-singular sub- 
matrix has a determinant of -|- 1 or — 1 . The advantage of this property is that any totally 
unimodular system of inequalities with integral constants is solvable in Z iff it is solv- 
able in R. 

Theorem 3. The matrix of the inequalities associated with Gen-Sequence constraint 
is totally unimodular 

In practice, only integral values for the bounds U and Ui are used. Thus the consis- 
tency of a Gen- Sequence constraint can be determined via interior point algorithms 
for LP in 0{n^-^ \ogu) time. Using the failed literal test, we can enforce DC at a cost 
of 0(n^-^ logu) down the branch of a search tree for any Gen-Sequence constraint. 
This is too expensive to be practical. We can, instead, exploit the fact that the matrix for 
each Gen-Sequence constraint has the consecutive ones property for rows (before 
the introduction of slack/surplus variables). Corresponding to the row transformation 
for matrices with consecutive ones for columns is a change-of-variables transformation 
into variable Sj = Ylil=i -^i matrices with consecutive ones for rows. This gives the 
dual of a network matrix. This is the basis of an encoding of Sequence in [10] (de- 
noted there CD). Consequently that encoding extends to Gen-Sequence. Adapting 
the analysis in [10] to Gen-Sequence, we can enforce DC in 0{nm + v? log n) time 
down the branch of a search tree. 

In summary, for a compilation cost of 0{mr), we can enforce DC on a 
Gen-Sequence constraint in O(n^) down the branch of a search tree, when it has 
a flow representation, and in 0{nm + log n) when it does not. 




(2) 
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7 A SlidingSum Constraint 



The SlidingSum constraint [20] is a generalization of the Sequence constraint 
from Boolean to integer variables, which we extend to allow arbitrary windows. 
SlidingSum ([Xi, . . . , X„], [pi, . . . holds iff k < E'L^*""^ < Ui holds 
where pi = {li, Ui, ki, Si) is, as with the generalized SEQUENCE, a window. The con- 
straint can be expressed as a linear program V called the primal where is a matrix 
encoding the inequalities. Since the constraint represents a satisfaction problem, we 
minimize the constant 0. The dual V is however an optimization problem. 



min 



" w ' 




' I ' 


-w 


X > 


—u 


I 




a 


-I 




-5 



min \—l u —ab\Y 

\ V lyV^ -w^ I -i]Y = o 

Y >0 



}V (3) 



Von Neumann's Strong Duality Theorem states that if the primal and the dual prob- 
lems are feasible, then they have the same objective value. Moreover, if the primal is 
unsatisfiable, the dual is unbounded. The SlidingSum constraint is thus satisfiable 
if the objective function of the dual problem is zero. It is unsatisfiable if it tends to 
negative infinity. 

Note that the matrix W'^ has the consecutive ones property on the columns. The 
dual problem can thus be converted to a network flow using the same transformation as 
with the Sequence constraint. Consider the dual LP of our running example: 



Minimize - J2i=i + Y^i=i ^i^i 



4+i 



h;Y: 



subject to: 



'10 1-1 
110 1-1 

1111-1 

1110-1 
.10 10-1 



-1 1 
-10 10 
-10 10 
10 
1 



-1 
-1 







Our usual transformation will turn this into a network flow problem: 



Minimize - ^^^=1 hYi + Yh=i UiY^+i - Yl^^i caYs+i + Ei=i hYia+i subject to: 



1 00 1 -1 00 -1 1 0000 -1 0000\ 

1 000 -1 00 -1 1 000 1 -1 00 \ / Yi 

1 -1 -1 1 1 -1 |( 

-1 1 -1 1 1 -1 I • 

-1 1 -1 1 i-i/Vris 

-1 -1 1 1 00000 -1 0000 1/ 



The flow associated with this example is given in Figure 3. There are n + 1 nodes 
labelled from 1 to n + 1 where node i is connected to node i + 1 with an edge of cost 
— Oj and node z -|- 1 is connected to node i with an edge of cost 6j. For each window pi, 
we have an edge from Si to Si + ki with cost —U and an edge from Sj + ki to Si with 
cost Ui. All nodes have a null supply and a null demand. A flow is therefore simply a 
circulation i.e., an amount of flow pushed on the cycles of the graph. 
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Fig. 3. Network flow associated to the SlidingSum constraint posted on the running example. 

Theorem 4. The SlidingSum constraint is satisfiable if and only there are no nega- 
tive cycles in the flow graph associated with the dual linear program. 

Proof. If there is a negative cycle in the graph, then we can push an infinite amount of 
flow resulting in a cost infinitely small. Hence the dual problem is unbounded, and the 
primal is unsatisfiable. Suppose that there are no negative cycles in the graph. Pushing 
any amount of flow over a cycle of positive cost results in a flow of cost greater than 
zero. Such a flow is not optimal since the null flow has a smaller objective value. Push- 
ing any amount of flow over a null cycle does not change the objective value. Therefore 
the null flow is an optimal solution and since this solution is bounded, then the primal 
is satisfiable. Note that the objective value of the dual (zero) is in this case equal to the 
objective value of the primal. □ 

Based on Theorem 4 we build a BC filtering algorithm for the SlidingSum con- 
straint. The SlidingSum constraint is BC iff the following conditions hold: 

- value fli is the lower bound of a variable Xi, i = 1, . . . , n iff aj is the smallest 
value in the domain of X; such that there are no negative cycles through the edge 
weighted with — and labeled with the lower bound of Xi. 

- value bi is the upper bound of a variable Xi, i = 1, . . . , n iff 6i is the greatest 
value in the domain of Xi such that there are no negative cycles through the edge 
weighted with bi and labeled with the upper bound of Xi 

The flow graph has 0{n) nodes and 0{n + m) edges. Testing whether there is a 
negative cycle takes 0{n^ + nm) time using the Bellman-Ford algorithm. We find for 
each variable Xi the smallest (largest) value in its domain such that assigning this value 
to Xi does not create a negative cycle. We compute the shortest path between all pairs 
of nodes using Johnson's algorithm in 0(|y p log \V\ -\- \ V\\E\) time which in our case 
gives 0{n^ \ogn + nm) time. Suppose that the shortest path between i and i + 1 has 
length s{i,i+l), then for the constraint to be satisfiable, we need bi + s{i,i+l) > 0. 
Since bi is a value potentially taken by Xi, we need to have Xi > —s{i,i-\-l). We 
therefore assign min((iom(Xi)) ^ max(min((iom(Xi)), —s(i,i + 1)). Similarly, let 
the length of the shortest path between i + 1 and ih& s{i+ 1, z). For the constraint to 
be satisfiable, we need s{i -\- — ai > 0. Since aj is a value potentially taken by 
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Xi, we have Xi < s{i + 1, i). We assign max{Xi) min(max(Xi), s{i + 1, i)). It 
is not hard to prove this is sound and complete, removing all values that cause negative 
cycles. Following [10], we can make the propagator incremental using the algorithm 
by Cotton and Maler [21] to maintain the shortest path between \P\ pairs of nodes in 
0(|-E| + \ V\ log \ V\ + \P\) time upon edge reduction. Each time a lower bound is 
increased or an upper bound bi is decreased, the shortest paths can be recomputed in 
0{m + n log n) time. 

8 Experimental Results 

To evaluate the performance of our filtering algorithms we carried out a series of ex- 
periments on random problems. The experimental setup is similar to that in [10]. The 
first set of experiments compares performance of the flow-based propagator FB on 
single instance of the SEQUENCE constraint against the HPRS propagator^ (the third 
propagator in [9]), the CS encoding of [10], and the Among decomposition {AD) of 
Sequence. The second set of experiments compares the flow-based propagator FBs 
for the SoftSequence constraint and its decomposition into soft Among constraints. 
Experiments were run with ILOG 6. 1 on an Intel Xeon 4 CPU, 2.0 Ghz, 4G RAM. Boost 
graph library version 1.34.1 was used to implement the flow-based algorithms. 

4 = 1 , k -7 4 = 5, k =7 




The number of variables (n) The number of variables (n) 



Fig. 4. Randomly generated instances witli a single SEQUENCE constraints for different combinations of A and k. 

8.1 The Sequence constraint 

For each possible combination of n e {500,1000,2060,3000,4000,5000}, k G 
{5, 15, 50}, A — u — I e {1,5}, we generated twenty instances with random lower 
bounds in the interval (0, k — A). We used random value and variable ordering and a 
time out of 300 sec. We used the Ford-Fulkerson algorithm to find a maximum flow. Re- 
sults for different values of A are presented in Tables 1, 2 and Figure 4. Table 1 shows 
results for tight problems with A ~ 1 and Table 2 for easy problems with A ~ 5. 

' We would like to thank Willem-Jan van Hoeve for providing us with the implementation of 
the HPRS algorithm. 
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To investiage empirically the asymptotic growth of the different propagators, we plot 
average time to solve 20 instances against the instance size for each combination of 
parameters k and A in Figure 4. First of all, we notice that the CS encoding is the best 
on hard instances ( A = 1) and the AD decomposition is the fastest on easy instances 
(A = 5). This result was first observed in [10]. The FB propagator is not the fastest 
one but has the most robust performance. It is sensitive only to the value of n and not to 
other parameters, Uke the length of the window(fc) or hardness of the problem(Z\). As 
can be seen from Figure 4, the FB propagator scales better than the other propagators 
with the size of the problem. It appears to grow linearly with the number of variables, 
while the HPRS propagator display quadratic growth. 

Table 1. Randomly generated instances with a single Sequence constraint and A = \. Number of instances solved in 
300 sec / average time to solve. We omit results for n e {1000, 3000, 4000} due to space limitation. The summary rows 
include all instances. 



n k 


AD 


CS 


HPRS 


FB 


500 7 
15 
50 


8 /2.13 
6 /O.Ol 
2 /0.02 


20 / 0.13 
20 / 0.09 
20 / 0.07 


20 /0.35 
20 / 0.30 
20 / 0.26 


20 /0.30 
20 / 0.29 
20 /0.28 


2000 7 

15 
50 


4 /0.04 

/O 

1 A) 


20 / 4.25 
20 / 1.84 
20 / 1.16 


20 / 18.52 
20 / 15.19 
20 / 13.24 


20 / 4.76 
20 /4.56 
20 / 4.42 


5000 7 
15 
50 


1 A) 
/O 
/O 


20 / 64.05 
20 / 24.46 
20 / 8.24 


15 / 262.17 
17 /211.17 
19 / 146.63 


20 / 36.09 

20 / 34.59 
20 / 31.66 


TOTALS 
solved/total 
avg time for solved 
avg bt for solved 


37 /360 
0.517 
17761 


360 /360 
9.943 

429 


351 /360 
60.973 



360 /360 
11.874 




Table 2. Randomly generated instances with a single Sequence constraint and ^ — 5. Number of instances solved in 
300 sec / average time to solve. We omit results for n e { 1000, 3000, 4000} due to space limitation. The summary rows 
include all instances. 



n k 


AD 


CS 


HPRS 


FB 


500 7 
15 
50 


20 / 0.01 
20 / 0.01 

18 /0,02 


20 70,58 
20 70,69 
20 / 0.20 


20 70,15 
20 70,25 
20 70,37 


20 70.44 
20 70,44 
20 70,42 


2000 7 
15 
50 


20 / 0.07 
20 / 0.07 

5 /5,19 


20 7 32,41 
20 7 39,71 
20 79,52 


20 77,19 
20 714,89 
20 7 13,71 


20 76,62 
20 76,63 
20 7 6.94 


5000 7 
15 
50 


20 / 0.36 
20 / 0.36 

9 /0,48 


70 
6 7 160.99 
20 7 108,34 


20 7109.18 
17 7215.97 
11 7210.53 


20 746.42 
20 745.97 
20 7 44.88 


TOTALS 

solved/total 
avg time for solved 
avg bt for solved 


296 /360 
0.236 
888 


308 7360 
52.708 
1053 


345 7360 
50.698 



360 7360 
16.200 




8.2 The Soft Sequence constraint 

We evaluated performance of the soft Sequence constraint on random problems. For 
each possible combination of n e {50, 100}, k e {5, 15, 25}, Z\ = {1, 5} and m G {4} 
(where m is the number of SEQUENCE constraints), we generated twenty random in- 
stances. All variables had domains of size 5. An instance was obtained by selecting ran- 
domlower bounds in the interval (0, A; — We excludedinstances where J Zi > k 
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to avoid unsatisfiable instances. We used a random variable and value ordering, and a 
time-out of 300 sec. All Sequence constraints were enforced on disjoint sets of cardi- 
nality one. Instances with Zi = 1 are hard instances for Sequence propagators [10], so 
that any DC propagator could solve only few instances. Instances with Z\ = 5 are much 
looser problems, but they are still hard do solve because each instance includes four 
overlapping SEQUENCE constraints. To relax these instances, we allow the Sequence 
constraint to be violated with a cost that has to be less than or equal to 15% of the 
length of the sequence. Experimental results are presented in Table 3. As can be seen 
from the table, the FBs algorithms is competitive with the decomposition into soft 
Among constraints on relatively easy problems and outperforms the decomposition on 
hard problems in terms of the number of solved problems. 

We observed that the flow-based propagator for the SoftSequence constraint 
(FBs) is very slow. Note that the number of backtracks of FBs is three order of mag- 
nitudes smaller compared to ADs- We profiled the algorithm and found that it spends 
most of the time performing the aU pairs shortest path algorithm. Unfortunately, this is 
difficult to compute incrementally because the residual graph can be different on every 
invocation of the propagator. 

Table 3. Randomly generated instances with 4 soft Sequences. Number of instances solved in 300 sec / average time to 
solve. 





A = l 


A = 5 


n k 


ADs 


FBs 


ADs 


FBs 


50 7 

LS 
25 


6 / 19.30 

8 / 36.(17 
6 / 0.73 


7 / 27.91 
13 / 20.41 
10 / 23.27 


20 / 0.01 
11 / 49.49 
10 / 6.40 


20 /2.17 
10 / 30.51 
10 /7.41 


100 7 
15 
25 


1 /() 

/o 
/o 


3 / 7.56 
5 / 6.90 
5 / 4.96 


19 / 10.50 
3 / 0.01 
5 / 19.07 


18 / 16.51 
3 /7.20 
5 / 23.99 


TOTALS 

solved/total 
avg time for solved 
avg bt for solved 


21 /120 
19.463 
245245 


43 /120 
18.034 
343 


68 /120 
13.286 
147434 


66 /1 20 
13.051 
128 



9 Conclusion 

We have proposed new filtering algorithms for the Sequence constraint and several 
extensions including the soft Sequence and generahzed Sequence constraints which 
are based on network flows. Our propagator for the SEQUENCE constraint enforces do- 
main consistency in O(n^) time down a branch of the search tree. This improves upon 
the best existing domain consistency algorithm by a factor of O(logn). We also intro- 
duced a soft version of the Sequence constraint and propose an 0(n^ log n log log u) 
time domain consistency algorithm based on minimum cost network flows. These al- 
gorithms are derived from hnear programs which represent a network flow. They differ 
from the flows used to propagate global constraints like Gcc since the domains of the 
variables are encoded as costs on the edges rather than capacities. Such flows are ef- 
ficient for maintaining bounds consistency over large domains. Experimental results 
demonstrate that the FB filtering algorithm is more robust than existing propagators. 
We conjecture that similar flow based propagators derived from linear programs may 
be useful for other global constraints. 
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